Constant Time Clash Detection in protein Folding
نویسندگان
چکیده
Applications for the manipulation of molecular structures are usually computationally intensive. Problems like protein docking or ab-initio protein folding need to frequently determine if two atoms in the structure collide. Therefore, an efficient algorithm for this problem, usually referred as clash detection, can greatly improve the application efficiency. This work focus mainly on the ab-initio protein folding problem. A naive approach for the clash problem, the most commonly-used by molecular structure programs, consists in calculating the distance between every pair of atoms. We propose an efficient data structure that uses a three-dimensional array to store the atoms' position. We compare the proposed data structure with one of the best known general data structures for this type of problems (SAT tree) and with the naive approach. While the naive approach takes linear time to the number of atoms to verify if a new atom clashes with any previously-set atoms, the proposed data structure takes constant time to perform the same verification. The SAT tree takes logarithmic time for the same task. The results show that the proposed data structure surpasses the other techniques for any protein size. The proposed data structure takes near half of the time of the SAT data structure and close to a fifth of the time of the naive approach for the larger proteins. We believe that this data structure can improve the existing molecular structure applications by decreasing the computational cost needed for clash detection. The data structure presented in this work can be used for any protein structure clash verification, as long as the atoms that need to be checked are kept in the 3D array. This data structure is particulary useful when manipulating large sets of atoms, for example, in applications like loop prediction, structure refinement of large proteins, and protein docking.
منابع مشابه
An efficient clash detection method for molecular structures applications
Molecular structures applications are usually computationally intensive. Problems like protein docking, ab-initio protein folding and even some visualization software that verify structure correctness, need to constantly determine if two atoms in the structure collide. This problem is usually referred as clash detection. If for the ab-initio and docking problems the cost of these computations d...
متن کاملEnergy Study at Different Temperatures for Active Site of Azurin in Water, Ethanol, Methanol and Gas Phase by Monte Carlo Simulations
The interaction between the solute and the solsent molecules play a crucial role in understanding the various molecular processes involved in chemistry and biochemistry, so in this work the potential energy of active site of azurin have been calculated in solvent by the Monte Carlo simulation. In this paper we present quantitative results of Monte Carlo calculations of potential energies of ...
متن کاملProtein Stability, Folding, Disaggregation and Etiology of Conformational Malfunctions
Estimation of protein stability is important for many reasons: first providing an understanding of the basic thermodynamics of the process of folding, protein engineering, and protein stability plays important role in biotechnology especially in food and protein drug design. Today, proteins are used in many branches, including industrial processes, pharmaceutical industry, and medical fields. A...
متن کاملSpatial Codes and the Hardness of String Folding Problems (Extended Abstract)
(Extended Abstract) Ashwin Nayak Alistair Sinclair y Uri Zwick z Abstract We present the rst proof of NP-hardness (under randomized polynomial time reductions) for string folding problems over a nite alphabet. All previous such intractability results have required an unbounded alphabet size. These problems correspond to the protein folding problem in variants of the hydrophobic-hydrophilic (or ...
متن کاملSpatial Codes and the Hardness of String Folding Problems ( Extended
(Extended Abstract) Ashwin Nayak Alistair Sinclair y Uri Zwick z Abstract We present the rst proof of NP-hardness (under randomized polynomial time reductions) for string folding problems over a nite alphabet. All previous such intractability results have required an unbounded alphabet size. These problems correspond to the protein folding problem in variants of the hydrophobic-hydrophilic (or ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of bioinformatics and computational biology
دوره 7 1 شماره
صفحات -
تاریخ انتشار 2009